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1 

MANAGING CONDITIONS IN A NETWORK 

BACKGROUND 

The invention relates to managing conditions in a net- 
work. 

In a computer network, problems or other conditions 
associated with one or more network devices frequently pop 
up. To manage and maintain such problems in the network, 
an event log is typically stored on a selected network 
management system. To notify the management system of 
predetermined conditions, a network device sends a message 
(e.g., trap) to the management system. If the management 
server is on and it recognizes the trap condition, then an 
entry is added to the event log. However, if the management 
system is off when the message is transmitted, the message 
is missed and the event log is not updated. 

As the number of devices connected to the network 
increases, the size of the event log also increases. To 
maintain the event log at a manageable size, a network 
administrator must manually remove entries as conditions 
are resolved. 

SUMMARY 

In general, in one aspect, the invention features a network 
system having a network device and a management system 
configured to identify a condition associated with the net- 
work device and to automatically update an event list in 
response to the condition being resolved. 

Implementations of the invention may include one or 
more of the following features. The management system 
polls the network device to determine the status of the 
condition. The network device is configured to transmit a 
message to the management system in connection with an 
occurrence of the condition. The management system auto- 
matically updates the event list by determining if a condition 
on the event list has been resolved, and if so, removing a 
corresponding entry from the event list. 

The invention may have one or more of the following 
advantages. Problems and other conditions kept in a log are 
automatically removed without user intervention. As a 
result, the log reflects the current state of all monitored 
network devices since it only keeps information for condi- 
tions that have not been resolved. Because a management 
system can poll devices for conditions in the network, the 
likelihood that conditions are missed by the management 
system is reduced. 

Implementations of the invention may further include the 
following features. The management system includes a 
storage device for storing a description database identifying 
information associated with the condition. The description 
database can be updated to add more conditions. The 
description database includes an MIB variable associated 
with the condition. 

In general, in another aspect, the invention features a 
computer system for connection to a network. The computer 
system includes a processor, a management program 
executed by the processor to identify a condition on the 
network, and a storage device on which an event list is 
stored. The management program updates the event list in 
response to the condition being resolved. 

In general, in another aspect, the invention features a 
method of managing conditions in a network of network 
devices. A condition on the network is identified, and a 
computer system on the network automatically updates an 
event list in response to the condition being resolved. 
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. In general, in another aspect, the invention features a 
computer program implemented in a computer for causing 
the computer to identify a condition in a network, and to 
automatically update an event list in response to the condi- 

5 tion being resolved. 

In general, in another aspect, the invention features a 
computer-implemented graphical user interface method that 
displays an event list containing an entry associated with an 
identified condition in a network, and automatically updat- 

10 ing the event list in response to the condition being resolved. 
In general, in another aspect, the invention features a 
display that is displayable by a management program on a 
computer monitor. The display has a first screen listing 
conditions associated with devices in a network. The first 

15 screen is updated in response to the conditions being 
resolved. 

Other features and advantages will become apparent from 
the following description and from the claims. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram of a computer network. 

FIG. 2 is a block diagram of a management console 
system in the computer network. 
25 ' FIG. 3 is a flow diagram illustrating the steps of a network 
management application running on the management con- 
sole system. 

FIG. 4 is a table listing recognized conditions. 
" FIG. 5 is a table of information and steps associated with 
30 the conditions. 

FIG. 6 shows an action list and device list maintained by 
the network management application. 

FIG. 7 shows an action detail screen describing the 
specific help for an action. 

FIG. 8 shows an action list log screen listing the complete 
history of the action list, 

FIGS. 9A and 9B are a table listing the conditions 
identified by data type and fault type. 
40 FIG. 10 is a block diagram of a computer system such as 
that used to implement the management console system. 

DETAILED DESCRIPTION 

Referring to FIG. 1, an exemplary network 8 (including a 
45 local area network 10) includes computing devices 20, 22, 
24, 26, 28, 30, 32 and 34. The devices can include desktop, 
laptop, hub, and gateway systems. The network 8 can be a 
TCP/IP (transmission control protocol/internet protocol), an 
IPX/SPX (a Novell Netware protocol), or other type proto- 
50 col network. 

A management console system 26 in the network 10 runs 
a network management application 100 (FIG. 2) that main- 
tains an action list of identified problems and other condi- 
tions of network devices. The management application 100 
55 can automatically detect when a condition has been 
resolved, in which case the condition is automatically 
removed from the action list. As a result, the action list 
maintains only conditions that currently exist, which pro- 
vides a manageable list that a network administrator can 
60 quickly look through. A further advantage is that the action 
list does not need to be manually updated continually. 

In addition, the management application 100 maintains a 
description database 116 to identify information associated 
with a condition. Using the description database, the appli- 
es cation can identify the status of a particular condition and 
determine the appropriate tasks to perform to resolve a 
particular condition. 
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Referring to FIG. 2, a network message passed over the rate); utilization levels (e.g., CPU utilization 80%); and 

network 8 and addressed to the management application 100 software version control, in which the management appli- 

is received by a network interface 118 (including a network cat ion monitors software versions on network devices, 

interface controller and its associated firmware) through the reports out of date wftwztt on the action list, and informs 

network connector 120 of the management console system 5 #u . a. ^ • *i ui *l . 

_ ^ . . 6 , I**** the user that a software upgrade is available or that a 

26. The message is passed to a network protocol layer 114 Ci , A , r . , 

(such as a TCP/IP layer), which converts the network f ,ftware u PS rade must be P erformed t0 P revent a cr ^al 

message to a format understood by the operating system 112 ^* 

and the network management application 100. The descrip- The ACTINFO table also includes an Enabled column 

tion database 116 and action list 117 are stored on a storage 10 302 t0 - m tisate whether the particular action is enabled or 

device in the management console system 26. In addition disabkd (y fof enabled flnd N fof ^ bicd y If an action is 

the action list 117 is displayable on a computer display 102 •>>,,,, , . , .. , 

through a video device driver 110 and video interface 108. dlsabled ' thsi . P 3 ^" action is not acted upon by the 

„ . „ , management application 100. An Action Type column 304 

Reternng to HG. 3, to perform an automated network t ■ *u i w * / 

s , y x\j ij-i u auiuuia^ iiutwuiA. categorizes the fault type ot the action (e.g., a mass storage 

maintenance process, the management application 100 at 35 r L j c 1 , • . r _. r , 

« taa c * * . V j . J* j i *%■%£* / • i fault, a system board fault, network interface card fault, a 

step 700 first creates the description database 116 (including c . . r , * 

tables shown in FIGS. 4 and 5). Hie description database software version 0001101 fault > etc >- 

116 stores the types of problems or conditions that are An Update Type column 306 indicates the type of updates 

recognizable by the management application 100, informa- requested for the particular action— whether updates are 

tion to be accessed and processed to determine the status of 20 timed ( per j od i c polling) or performed in response to a trap 

a condition, and tasks to be performed by a user to resolve fr 0m a network device. If the latter, then the management 

the condition. The description database 116 can be easily application 100 updates the action list 117 only when it 

updated by a network administrator to add new types of receives a trap from a network device and does not rfodi . 

problems and conditions that the management application caU issue rfes for [hc ticular action . A &verit 

100 can handle. 25 J r J 

column 3Uo indicates the seventy or the particular action, 

Next, the management application at step 702 waits for with the value t indicating the most severe condition and 

receipt of an event, which can be either a network device higher ya]ues indicating less xwm conditions. A Bitmap ID 

message or a timeout event. The network message can column 310 idemifies a hical usef interface (GU[) 

include a notifica^ c]fimem ( an icQn) ^ ^ action ^ ^ 

a problem or other condition has occurred. The message , . , , , J^. 4l _ „ ' 

•j »•« , u ♦ • i i • o.. j displayed. Thus, the Replace Processor action can be asso- 

ldentines the particular condition or conditions of the send- . f , . . ■/..,■« 

ing device. A periodic timeout event can also be generated clated wlth a s P ecific loon d *P la y ed b X the «"P"ter to the 

in the management application so that it can periodically user 

poll the network devices for problems and other conditions. ^ i n FIG. 5, an ACT_PROC table includes an action 

The management application at step 704 then accesses the number column 400 listing the same action numbers as in 

description database 116 to determine the types of informa- the ACTINFO table in FIG. 4. In addition, the ACT_PROC 

tion to request. If a specific network device trap is received, table includes an Item Name column 404, which lists 

then the management application accesses the description variables, (e.g., Management Information Base or MIB 

database 116 to determine the information associated with 4Q variables according to the SNMP or Simple Network Man- 

the trap condition to request from the sending network age ment Protocol), which are checked by the management 

device. application 100 to determine the status of the particular 

If the event is a timeout, then items identified in the actiorL A Data Source column 406 specifies the particular 

descnption database 116 as requiring periodic updates are of the action> e g ? an SNMP source or other protocol 

addressed. Query messages are sent to the network devices 45 A Check T column 408 mdicates the t of 

to request the required information By using the polling checki ired of ^ condition ^ [n co{umQ m for 

reature, the management application 100 can identify when «■ i «• i i c * • i- , 

A \ . u u i j u. * * * .a a particular action. For example, a value of 1 indicates an 

a condition has been resolved so that information associated „. , „ . ue _ „ „ 

%u *u * u j g ,t • 1 • . rm mdexscan while a value of 2 indicates a full scan .A full 

with that condition can be removed from the action list 117. , , "uiawiu .mun 

VT t , , . scan checks all drives and produces a true result if any 

Next, at step 706, the management applicator, transmits 50 combinalion of lhe devices contains the conditions to be 

requests to the appropriate network device or devices to . , , ™ c , . . 

request information relating to the relevant problems or c u heckcd " for example, if an action is associated with 

conditions. When the requested information is received from three con <^ions and all three must be true (AND operation), 

the network device or devices, the management application then the ful1 scan P roduces a true r ^ult if condition 1 is 

at step 708 checks the information. Next, at step 710, the 55 found 00 device X ' condltlOD 2 15 fc >und on device Y, and 

action list 117 is updated, by either adding entries (if new condition 3 is found on device Z, where X, Y, and Z can be 

problems or conditions are found) or deleting entries (if the tne same or different devices. 

problems or conditions have been resolved). ^ index ^ 0Q the other hand> fequires ^ tfae 

Referring to FIGS. 4 and 5, the description database 116 conditions to be checked must all be located on one device, 

includes two tables. The first table is the ACTINFO table, 60 i n the example above, conditions 1, 2, and 3 must all be on 

which lists action numbers (in column 300) and associated one device for lhe mdGX ^ t0 be lrue 
descriptions (in column 312). For example, action number 

5000 corresponds to a Replace Processor action; action A Comp Type column 410 has a number of different 

number 10004 corresponds to a Check System Temperature values, e.g., 10, 11, 12, 20, 21, 22, or 23, to indicate the type 

action; action number 11001 corresponds to a Degraded 65 of comparison to be performed on the variables listed in 

Drives action; and so forth. Other supported actions include column 404. The different Comp Type values and what they 

threshold levels (e.g., fan speed exceeds a predetermined represent are listed below in Table 1. 
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TABLE 1 



Comp Type Number 


Comparison 


10 


Num = 


11 


Num > 


12 


Num < 


20 


String - 


21 


String > 


22 


String < 


23 


In String 



The comparison is performed between the variable in col- 
umn 404 and the data field in column 412. For example, if 
the Comp Type value is 10, then the management applica- 
tion 100 determines if the particular variable contains a 
number equal to the corresponding value in column 412. As 
another example, a Comp Type Number of 21 would cause 
the management application to determine if the particular 
variable contains a string greater than the corresponding 
specified value. A Comp Type Number of 23 would cause 
the application to determine if a specified value is contained 
in the string in the variable. 

A Not column 414 specifies whether the result of the 
comparison should be inverted; a value of 0 indicates no 
inversion while a value of 1 indicates inversion. An Operator 
column 416 indicates fhe type of operation to be performed 
with another variable associated with the same action num- 
ber. For example, for action number 11000, three variables 
are specified, including cpqDaPhyDrvStatus, 
cpqScsiPhyDrvStatus, and cpqDaSpareCondition. The value 
in the Def Count column 402 indicates the sequence of the 
three variables associated with an action (e.g., 1, 2, 3). A 
value of 2 in the Operator column 416 indicates that an OR 
operation is performed between the variable in the corre- 
sponding row and the variable in the next row associated 
with the same action. A value of 1 would indicate an AND 
operation and a value of 0 indicates a NOP or no operation. 

Thus, the ACT_PROC table specifies the relevant param- 
eters associated with an action and how the parameters are 
to be processed to determine the status of a condition. 

Referring to FIG. 6, an Action List screen 500 displays 
information from the action list 117. The screen 500 shows 
conditions associated with network devices, which are listed 
in a Device List window 502. Exemplary network devices 
are identified as Accounting, Human 13 Factors, Human_ 
Resources, Marketing, Sales, Dave_Smith, Kevin, and 
Mike. In this example, Dave_Smith, Kevin, and Mike are 
desktop units while Accounting, Human_Factors, Human_ 
Resources, Marketing and Sales are servers. 

There are two different ways of displaying the action list 
screen 500, which are to display by action or to display by 
server (as shown by 504). The action list screen provides a 
convenient means of displaying information associated with 
network device conditions to a network administrator. If a 
condition is resolved, the corresponding item is removed 
from the action list. Each action has a priority, as indicated 
by the value in the Severity column 308 in FIG. 4, and 
actions with higher priorities are displayed toward the top of 
the list. 

Referring to FIG. 7, an Action Detail screen 506 provides 
specific help to the user for each action. The purpose of this 
screen is to tell the user exactly what happened, and the steps 
that must be performed to resolve the problem. The action 
details include a description of the problem, specific details 
about the problem, and a list of steps to correct the problem. 
Double clicking on an action in the Action List screen 500 
opens the Action Detail screen. 
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FIG. 8 shows an Action List Log screen 508, which 
contains a complete history of the action list. As items are 
added or removed from the list, they are recorded in the log. 
As actions come and go, a user who is not constantly 

5 watching the screen may miss an important event. The user 
can conveniently refer to the log to determine events that 
have occurred. For example, if the user sets a threshold on 
CPU utilization for 85%, the CPU utilization action will be 
entered into the log as the utilization varies. 

10 The conditions are divided according to three data types: 
action data, status data, and utilization data. Action data 
refers to an action that needs to be taken by the administrator 
to correct a system fault or degraded condition. Status data 
refers to the status of a system component. Utilization data 

35 displays performance or utilization information. In addition, 
as noted above, the conditions are also grouped according to 
fault types: rapid recovery faults, mass storage faults, system 
board faults, network interface card faults, and software 
version control faults. Table 2 (in FIGS. 9A and 9B) lists 

20 exemplary conditions by data type and fault type. 

Referring to FIG. 10, the physical components of the 
management system 26 are illustrated. The system 26 
includes a central processing unit (CPU) 200 connected by 
a CPU-PCI bridge 202 to a Peripheral Component Intercon- 

25 nect (PCI) bus 206. A main memory 204 is connected to the 
CPU 200 and CPU-PCI bridge 202. A mass storage device 
208, in the form of hard disk drives, for example, is 
connected to a SCSI controller 210, which is in turn con- 
nected to the PCI bus 206. A network controller card 212, 

30 which is part of the network interface 102 or 114 described 
in FIG. 2, is also connected to the PCI bus 206. A video 
controller 216 controls signals transmitted between the PCI 
bus 206 and a video display 214. 

An expansion bus 220, such as the Extended Industry 

35 Standard Architecture (EISA) or the Industry Standard 
Architecture (ISA) bus, is connected to the PCI bus 206 
through a PCI-expansion bus bridge 218. The expansion bus 
220 is connected to an input/output (I/O) controller 222, 
which provides interface ports to a keyboard 224, a pointer 

40 device 226 (such as a mouse), and a parallel port 228. The 
I/O controller 222 also controls access to a non-volatile 
random access memory (NVRAM) 230, which can be 
implemented with a flash memory or an EEPROM. The 
NVRAM is used to store the system BIOS for performing 

45 the power up sequence in the computer system 26. Expan- 
sion slots 232 are connected to the expansion bus 220 to 
provide further expansion capabilities. 

Other embodiments are within the scope of the following 
claims. For example, a modified version of the management 

50 application can be loaded on network devices other than the 
management console system 26. In this alternative network 
system, the local management application in each network 
device can obtain and process the information as indicated 
in a locally loaded description database and send that 

5S information to the management console system in place of 
a trap message. 
What is claimed is: 

1. A computer system for connection to a network, 
comprising: 
60 a processor; 

a management program executed by the processor to 
identify a plurality of conditions on the network; and 
a storage device on which an action list and an action log 
are stored, wherein the management program updates 
65 the action list without user intervention in response to 
the condition being resolved, such that any resolved 
operating condition is removed and only unresolved 
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conditions remain on the action list, the management 
program providing a user access to the action list and 
instructions related to correcting a problem condition 
on the action list and wherein the management program 
updates the action log to contain a history of the 
operating conditions of the action list. 

2. The computer system of claim 1, wherein at least one 
network device, connected to the network, transmits a 
message to the management program in connection with an 
occurrence of said plurality of conditions. 

3. The computer system of claim 1, wherein at least one 
network device, connected to the network, transmits a 
message to the management program in connection with an 
occurrence of at least one of said plurality of conditions. 

4. The computer system of claim 1, wherein the storage 
device further stores a description database identifying 
information associated with the plurality of conditions. 

5. The computer system of claim 4, wherein the descrip- 
tion database can be updated to add more conditions. 

6. The computer system of claim 1, further comprising a 
video display on which the action list can be displayed. 

7. The computer system of claim 6, wherein displayed 
entries of the action list are selectable by a user to allow 
further information to be displayed. 

8. A network system comprising: 

a plurality of network devices including a first network 
device; 

a network connecting said plurality of network devices; 

a management system connected to said network, the 
management system configured to identify a plurality 
of operating conditions associated with the plurality of 
network devices, the management system further con- 
figured to produce an action list for the plurality of 
operating conditions which is updated to provide a user 
an accurate status of the plurality of operating condi- 
tions associated with the plurality of network devices, 
said management system further being configured to 
produce an action log containing a history of the 
operating conditions of the action list; and 

a storage device on which the action list and the action log 
are stored, wherein the management system updates the 
action list without user intervention in response to an 
operating condition being resolved such that any 
resolved operating condition is removed and only unre- 
solved operating conditions remain on the action list. 

9. The network system of claim 8, wherein the manage- 
ment system polls the plurality of network devices to 
determine the status of said plurality of operating conditions. 

10. The network system of claim 8, wherein at least one 
of said plurality of network devices is configured to transmit 
a message to the management system in connection with an 
occurrence of at least one of said plurality of operating 
conditions. Q 

11. The network system of claim 8, wherein the manage- 
ment system includes a storage device for storing a descrip- 
tion database identifying information associated with the 
action list. 
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12. The network system of claim 11, wherein the descrip- 
tion database can be updated to add more conditions. 

13. A method for managing conditions in a network, 
comprising the acts of: 

identifying operating conditions of a plurality of devices 

coupled to a network; 
generating an action list and an action log based on the 

identified operating conditions; 
updating the action list and the action log without user 

intervention; 

determining, without user intervention, whether one of the 
identified operating conditions represented on the 
action list has been resolved; and 

removing, without user intervention, any resolved oper- 
ating condition from the action list and recording the 
resolved operation condition on the action log. 

14. The method as recited in claim 13, comprising the act 

of: 

maintaining a database of known operating conditions, 
the database comprising 

information related to each known operating condition; 

accessing the database to determine information rel- 
evant to one of the identified operating conditions; 
and 

requesting the relevant information from the device 
associated with the one of the identified operating 
conditions. 

15. The method as recited in claim 14, wherein the 
database further comprises instructions relevant to resolving 
the known operating conditions, and wherein the method 
comprises the act of: 

providing a user access to the action list and to the 
instructions relevant to resolving the one of the iden- 
tified operating conditions on the action list. 

16. The method as recited in claim 13, comprising the act 

of: 

storing the action list on a storage device. 

17. The method as recited in claim 13, comprising the act 

of: 

displaying the action list on a display device accessible to 
a user. 

18. The method as recited in claim 13, wherein the act of 
identifying operating conditions comprises the act of: 

periodically polling the devices on the network for data 
related to the operating conditions. 

19. The method as recited in claim 13, wherein the act of 
identifying operating conditions comprises the act of: 

receiving messages transmitted by the devices on the 
network, the messages comprising data related to the 
operating conditions. 
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